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Abstract 

Spectra of identified charged hadrons are measured in pp collisions at the LHC for 
a/s = 0.9, 2.76, and 7TeV. Charged pions, kaons, and protons in the transverse- 
momentum range px ~ 0.1-1.7GeV/c and for rapidities |y| < 1 are identified via 
their energy loss in the CMS silicon tracker. The average px increases rapidly with 
the mass of the hadron and the event charged-particle multiplicity, independently of 
the center-of-mass energy. The fully corrected px spectra and integrated yields are 
compared to various tunes of the Pythia 6 and Pythia 8 event generators. 
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1 Introduction 

The study of hadron production has a long history in high-energy particle and nuclear physics, 
as well as cosmic-ray physics. The absolute yields and the transverse momentum (pj) spectra 
of identified hadrons in high-energy hadron-hadron collisions are among the basic physical 
observables that can be used to test the predictions for non-perturbative quantum chromody- 
namics processes like hadronization and soft parton interactions, and their implementation in 
Monte Carlo (MC) event generators. The dependence of these quantities on the hardness of 
the pp collision provides valuable information on multi-parton interactions as well as on other 
final-state effects. In addition, the measurements of baryon (and notably proton) production 
are not reproduced by the existing models, and more data at higher energy may help improv- 
ing the models. Spectra of identified particles in proton-proton (pp) collisions also constitute 
an important reference for high-energy heavy-ion studies, where final-state effects are known 
to modify the spectral shape and yields of different hadron species. 

The present analysis focuses on the measurement of the pj spectra of charged hadrons, iden- 
tified mostly via their energy deposits in silicon detectors, in pp collisions at a/ s = 0-9, 2.76, 
and 7 TeV. In certain phase space regions, particles can be identified unambiguously while 
in other regions the energy loss measurements provide less discrimination power and more 
sophisticated methods are necessary. 

This paper is organized as follows. The Compact Muon Solenoid (CMS) detector, operating at 
the Large Hadron Collider (LHC), is described in Section|2] Elements of the data analysis, such 
as event selection, tracking of charged particles, identification of interaction vertices, and treat- 
ment of secondary particles are discussed in Section |3j The applied energy loss parametriza- 
tion, the estimation of energy deposits in the silicon, and the calculation of the energy loss rate 
of tracks are explained in Section|4] In Section[5]the various aspects of the unfolding of particle 
yields are described. After a detailed discussion of the applied corrections (Section^, the final 
results are shown in Section [7] and summarized in the conclusions. 

2 The CMS detector 

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diam- 
eter. Within the field volume are the silicon pixel and strip tracker, the crystal electromagnetic 
calorimeter, and the brass/ scintillator hadron calorimeter. In addition to the barrel and endcap 
detectors, CMS has extensive forward calorimetry. CMS uses a right-handed coordinate sys- 
tem, with the origin at the nominal interaction point and the z axis along the counterclockwise 
beam direction. The pseudorapidity and rapidity of a particle with energy E, momentum p, 
and momentum along the z axis p z are defined as rj = — lntan(0/2) where 6 is the polar an- 
gle with respect to the z axis and y = \ ln[(E + p z )/ (E — p z )\, respectively. A more detailed 
description of CMS can be found in Ref. ICQ. 

Two elements of the CMS detector monitoring system, the beam scintillator counters (BSCs) 
and the beam pick-up timing for the experiments (BPTX) devices, were used to trigger the de- 
tector readout. The two BSCs are located at a distance of ±10.86 m from the nominal interaction 
point (IP) and are sensitive to particles in the \t] | range from 3.23 to 4.65. Each BSC is a set of 16 
scintillator tiles. The BSC elements are designed to provide hit and coincidence rates. The two 
BPTX devices, located around the beam pipe at a distance of 175 m from the IP on either side, 
are designed to provide precise information on the bunch structure and timing of the incoming 
beam. A steel/ quartz-fibre forward calorimeter (HF) covers the region of \t] | between about 3.0 
and 5.0. The HF tower segmentation in \] and azimuthal angle <p is 0.175 x 0.175, except for \t]\ 
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2 The CMS detector 
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Figure 1: Left: values of the most probable energy loss rate e, at the reference path length 
of 450 /mi in silicon, for electrons, pions, kaons and protons 0. The inset shows the region 
1 < p < 5GeV/c. Right: for each particle, the accessible (y, pj) area is contained between the 
upper thicker (determined by particle identification capabilities) and the lower thinner lines 
(determined by acceptance and efficiency). More details are given in Section 
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above 4.7 where the segmentation is 0.175 x 0.35. 

The tracker measures charged particles within the pseudorapidity range \rj\ < 2.4. It has 1440 
silicon pixel and 15 148 silicon strip detector modules and is located in the 3.8 T field of the 
solenoid. The pixel detector [3j consists of three barrel layers (PXB) at radii of 4.4, 7.3, and 
10.2 cm as well as two endcap disks (PXF) on each side of the PXB. The detector units are seg- 
mented n-on-n silicon sensors of 285 /mi thickness. Each readout chip serves a 52 x 80 array 
of 150 /mi x 100 /mi pixels. In the data acquisition system, zero suppression is performed with 
adjustable thresholds for each pixel. Offline, pixel clusters are formed from adjacent pixels, 
including both side-by-side and corner-by-corner adjacent pixels. The strip tracker [4] employs 
p-in-n silicon wafers. It is partitioned into different substructures: the tracker inner barrel (TIB) 
and the tracker inner disks (TID) are the innermost part with 320 //m thick sensors, surrounded 
by the tracker outer barrel (TOB) with 500 /mi thick sensors. On both sides, the tracker is com- 
pleted by endcaps with a mixture of 320 fim thick sensors (TEC3) and 500 /mi thick sensors 
(TEC5). The first two layers of TIB and TOB and some of the TID and TEC contain "stereo" 
modules: two silicon modules mounted back-to-back with a 100 mrad angle to provide two- 
dimensional hit resolution. Each readout chip serves 128 strips. Algorithms are run in the 
Front-End Drivers (FED) to perform pedestal subtraction, common-mode subtraction and zero 
suppression. Only a small fraction of the channels are read out in one event. Offline, clusters 
are formed by combining contiguous hits. The tracker provides an impact-parameter resolu- 
tion of ~15 //m and an absolute pj resolution of about 0.02 GeV/c in the range pj ~ 0.1-2 GeV/c, 
of relevance here. 



2.1 Particle identification capabilities 

The identification of charged particles is often based on the relationship between energy loss 
rate and total momentum (Fig. [lj left). Particle reconstruction at CMS is limited by the accep- 
tance (C a ) of the tracker (\t]\ < 2.4) and by the low tracking efficiency (C e ) at low momentum 
(p > 0.05, 0.10, 0.20, and 0.35GeV/c for e, n, K, and p, respectively), while particle identi- 
fication capabilities are restricted to p < 0.15GeV/c for electrons, p < 1.20GeV/c for pions, 
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p < 1.05GeV/c for kaons, and p < 1.70GeV/c for protons (Fig. [TJ left). Pions are accessible up 
to a higher momentum than kaons because of their high relative abundance, as discussed in 
Section 5.2 The (y, px) region where pions, kaons and protons can all be identified is visible 
in the rightpanel of Fig. [I] The region — 1 < y < 1 was chosen for the measurement, since it 
maximizes the px coverage. 



3 Data analysis 

The 0.9 and 7 TeV data were taken during the initial low multiple-interaction rate (low "pileup") 
runs in early 2010, while the 2.76 TeV data were collected in early 2011. The requirement of 
similar amounts of produced particles at the three center-of-mass energies and that of small 
average number of pileup interactions led to 8.80, 6.74 and 6.20 million events for yfs = 0.9 TeV, 
2.76 TeV, and 7 TeV, respectively. The corresponding integrated luminosities are 0.227±0.024 nb _1 , 
0.143±0.008nb _1 and 0.115±0.005nb _1 00, respectively. 

3.1 Event selection and related corrections 

The event selection consists of the following requirements: 

• at the trigger level, the coincidence of signals from both BPTX devices, indicating 
the presence of both proton bunches crossing the interaction point, along with the 
presence of signals from either of the BSCs; 

• offline, the presence of at least one tower with energy above 3 GeV in each of the HF 
calorimeters; at least one reconstructed interaction vertex (Section [33| ; the suppres- 
sion of beam-halo and beam-induced background events, which usually produce an 
anomalously large number of pixel hits [7|. 

The efficiencies for event selection, tracking, and vertexing were evaluated by means of sim- 
ulated event samples produced with the Pythia 6.420 [8j MC event generator at each of the 
three center-of-mass energies. The events were reconstructed in the same way as the colli- 
sion data. The Pythia tunes D6T (9), Zl, and Z2 ||T0| were chosen, since they describe the 
measured event properties reasonably well, notably the reconstructed track multiplicity dis- 
tribution. Tune D6T is a pre-LHC tune with virtuality-ordered showers using the CTEQ6L 
parton distribution functions (PDF). The tunes Zl and Z2 are based on the early LHC data and 
generate px-ordered showers using the CTEQ5L and CTEQ6L PDFs, respectively. 

The final results were corrected to a particle level selection, which is very similar to the ac- 
tual selection described above: at least one particle (t > 10~ 18 s) with E > 3 GeV in the range 
—5 < t] < —3 and one in the range 3 < t] < 5; this selection is referred to in the follow- 
ing as "double-sided" (DS) selection. The overall efficiency of the DS selection for a zero-bias 
sample, according to Pythia, is about 66-72% (0.9 TeV), 70-76% (2.76 TeV), and 73-78% (7 TeV). 
The ranges given represent the spread of the predictions of the different tunes. Mostly non- 
diffractive (ND) events are selected, with efficiencies in the 88-98% range, but a smaller frac- 
tion of double-diffractive (DD) events (32-38%), and single-diffractive dissociation (SD) events 
are accepted (13-26%) as well. About 90% of the selected events are ND, while the rest are 
DD or SD, in about equal measure. In order to compare to measurements with a non-single- 
diffractive (NSD) selection, the particle yields given in this study should be divided by factors 
of 0.86, 0.89, and 0.91 according to Pythia, for y/s = 0.9, 2.76, and 7 TeV, respectively. The 
systematic uncertainty on these numbers due to the tune dependence is about 3%. 
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Figure 2: The ratio of selected events to double-sided events (ratio of the corresponding effi- 
ciencies in the inelastic sample), according to the Pythia6 tunes (0.9 TeV- D6T, 2.76 TeV- Z2, 
7TeV- Zl), as a function of the reconstructed primary charged track multiplicity. 

The ratios of the data selection efficiency to the DS selection efficiency are shown as a function 
of the reconstructed track multiplicity in Fig. [2] for the three center-of-mass energies studied. 
The ratios are used to correct the measured events; they are approximately independent of 
the Pythia tune. The different behavior of the 2.76 TeV data results from a change in the HF 
configuration in 2011. The results are also corrected for the fraction of DS events without a 
reconstructed track. This fraction, as given by the simulation, is about 4%, 3%, and 2.5% for 
0.9, 2.76, and 7 TeV, respectively. Since these events do not contain reconstructed tracks, only 
the event yield must be corrected. 

3.2 Tracking of charged particles 

The extrapolation of particle spectra into the unmeasured regions is model dependent, partic- 
ularly at low pj. A good measurement therefore requires reliable track reconstruction down 
to the lowest possible pj. The present analysis extends to pj ~ 0.1 GeV/c by exploiting spe- 
cial tracking algorithms [11], used in previous studies |[7Hl2"|, to provide high reconstruction 
efficiency and low background rate. 

The performance of the charged-particle tracking was quantified in terms of the geometrical 
acceptance, the tracking efficiency, and the fraction of misreconstructed tracks; all these quan- 
tities were evaluated by means of simulated events and validated in previous studies |7lH2|. 
The acceptance of the tracker (when at least two pixel hits are required) is flat in the region 
— 2 < t] < 2 and pj > 0.4GeV/c, and its value is about 96-98%. The loss of acceptance at 
pj < 0.4GeV/c is caused by energy loss and multiple scattering of particles, which both de- 
pend on the particle mass. Likewise, the reconstruction efficiency is about 80-90%, degrading 
at low pj, also in a mass-dependent way. The misreconstructed-track rate (Cf) is very small, 
reaching 0.3% only for pj < 0.25GeV/c; it rises slightly above 2GeV/c because of the steeply 
falling pj distribution. The probability of reconstructing multiple tracks (C m ) from a true sin- 
gle track is about 0.1% - mostly due to particles spiralling in the strong magnetic field. The 
efficiencies and background rates largely factorize in rj and pj, but for the final corrections an 
(rj, pj) grid is used. 




3.3 Vertexing and secondary particles 
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Table 1: Standard deviation of the vertex z coordinate distribution (a z ) and average number 
of pileup events for the three center-of-mass energies studied. The last two columns show the 
estimated fraction of merged and split vertices. More details are given in the text. 



Energy 




{pileup) 


Merged 


Split 


0.9 TeV 


6.67 cm 


0.016 


5 • 10" 4 


~10~ 3 


2.76 TeV 


6.23 cm 


0.094 


3 • HT 3 


~10~ 3 


7 TeV 


3.08 cm 


0.009 


6 • HT 4 


~10~ 3 



3.3 Vertexing and secondary particles 

The region where pp collisions occur (beam spot) is well measured by reconstructing vertices 
from many events. Since the bunches are very narrow, the transverse position of the interaction 
vertices is well constrained; conversely their z coordinates are spread over a relatively long 
distance and must be determined on an event-by-event basis. Reconstructed tracks are used 
for determining the vertex position if they have pj > 0.1 GeV/c and originate from the vicinity 
of the beam spot, i.e. their transverse impact parameter satisfies the condition dj < 3cy; here 
aj is the quadratic sum of the uncertainty of dj and the RMS of the beam spot distribution in 
the transverse plane. The agglomerative vertex-reconstruction algorithm [ 13 1 was used, with 
the z coordinates (and their uncertainty) of the tracks at the point of closest approach to the 
beam axis as input. This algorithm keeps clustering tracks into vertices as long as the smallest 
distance between the vertices of the remaining groups of tracks, divided by its uncertainty, 
is below 35. Simulations indicate that this value minimizes the number of merged vertices 
(vertices with tracks from two or more true vertices) and split vertices (two or more vertices 
with tracks from a single true vertex). For single-vertex events, there is no lower limit on the 
number of tracks associated to the vertex. If multiple vertices are present, only those with at 
least three tracks are kept. 

The distribution of the z coordinates of the reconstructed primary vertices is Gaussian, with 
standard deviations of 6 cm at 0.9 and 2.76 TeV, and 3 cm at 7 TeV. The simulated data were 
reweighted so as to have the same vertex z coordinate distributions as the data. The distribution 
of the distance Az between vertices was used to quantify the effect of pileup and the quality of 
vertex reconstruction. There is an empty region around Az = 0, which corresponds to cases in 
which two true vertices are closer than about 0.4 cm to each other and are merged during vertex 
reconstruction. The Az distribution was therefore used to determine the fraction of merged 
(and thus lost) vertices, and to estimate the fraction of split vertices (via the non-Gaussian 
tails). Both effects are at the 0.1% level and were neglected in this study. 

The number of primary vertices in a bunch crossing follows a Poisson distribution. The fraction 
of events with more than one vertex (due to pileup) is small in the 0.9 and 7 TeV data (1.6% and 
0.9%, respectively), but is 9.4% at 2.76 TeV. The interaction-region and pileup parameters are 
summarized in Table [l] For the 0.9 and 2.76 TeV data, bunch crossings with either one or two 
reconstructed vertices were used, while for the 7 TeV data the analysis was restricted to events 
with a single reconstructed vertex to suppress the larger background from pileup, split and 
merged vertices. 

The hadron spectra were corrected for particles of non-primary origin. The main source of 
secondary particles is the feed-down from weakly decaying particles, mostly Kg, A /A, and 

. While the correction (C s ) is around 1% for pions, it is up to 15% for protons with pj 
0.2GeV/c. This is expected because the daughter p or p takes most of the momentum of the 
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4 Energy deposits and estimation of energy loss rate 



Table 2: Properties of several strip subdetectors evaluated by using hits on tracks with close- 
to-normal incidence: readout threshold t, coupling parameter a c , standard deviation a n of the 
Gaussian noise. The three values given for a c and cr n are for the 0.9, 2.76, and 7TeV datasets. 



Detector 


t 

[keV] 




a n 
[keV] 


TIB 


9.6 


0.091, 0.077, 0.096 


6.9, 7.0, 6.9 


TID 


8.5 


0.076, 0.068, 0.081 


7.2, 7.6, 7.2 


TOB 


15.3 


0.116, 0.094, 0.124 


9.2, 10.3, 9.6 


TEC3 


8.5 


0.059, 0.059, 0.072 


6.3, 6.9, 6.4 


TEC5 


14.1 


0.094, 0.086, 0.120 


8.6, 9.7, 9.0 



primary A /A, and therefore has a higher probability of being (mistakenly) fitted to the primary 
vertex than a pion from a Kg decay. Since none of the weakly decaying particles mentioned 
decay into kaons, the correction for kaons is small. The corrections were derived from PYTHIA 
and cross-checked with data [14] by comparing measured and predicted spectra of particles. 
While data and simulation generally agree, the A/ A correction had to be multiplied by a factor 
of 1.6. 

For p < 0.15GeV/c, electrons can be clearly identified. According to Pythia, the overall e ± 
contamination of the hadron yields is below 0.2%. Although muons cannot be separated from 
pions, their fraction is negligible, below 0.05%. Since both contaminations are small no correc- 
tions were applied. 

4 Energy deposits and estimation of energy loss rate 

The silicon layers of the tracker are thin and the energy depositions do not follow a Gaussian 
distribution, but exhibit a long tail at high values. Ideally, the estimates of the energy loss rate 
should not depend on the path lengths of the track through the sensitive parts of the silicon 
or on the detector details. However this is not the case with the often used truncated, power, 
or weighted means of the differential deposits, AE/Ax. Some of the dependence on the path 
length can be corrected for, but a method based on the proper knowledge of the underlying 
physical processes is preferable. 

In the present paper a novel analytical parametrization [15] has been used to approximate 
the energy loss of charged particles. The method provides the probability density p(y\e,l) of 
energy deposit y, if the most probable energy loss rate e at a reference path-length Iq and the 
path-length I are known. The method can be used in conjunction with a maximum likelihood 
estimation. The deposited energy is estimated from the measured charge deposits in individual 
channels (pixels or strips) contributing to hit clusters. Deposits below the readout threshold 
or above the saturation level of the readout electronics are estimated from the length of the 
track segment in the silicon. This results in a wider accessible energy deposit range and better 
particle identification power. The method can be applied to the energy loss rate estimation of 
tracks and to calibrate the gain of the tracker detector front-end electronics. In this analysis, for 
each track, the estimated e value at Zo = 450 /mi was used for particle identification and yield 
determination. 

For pixel clusters, the energy deposits (and their variances) were calculated as the sum of in- 
dividual pixel deposits (and variances). The noise contribution is Gaussian, with a standard 
deviation cr„ « 10 keV per pixel. In the case of strips, the energy deposits were corrected for 
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capacitive coupling and cross-talk between neighboring strips. The readout threshold t, the 
coupling parameter oc c , and the standard deviation a n of the Gaussian noise for strips were 
determined from the data, by means of tracks with close-to-normal incidence (Table [2j. 

Table 3: Tight requirements for approximate particle identification. All £ values are functions 
of p. Subscripts n, K, and p refer to the most probable value for a given particle species, as 
expected from simulation. 



Particle 


Momentum 


Most probable energy loss rate 


pion 
kaon 
proton 


0.15 < p < 0.70GeV/c 
p < 0.70GeV/c 
p < 1.40GeV/c 


£ < (£ 7r + £ K )/2 

(e n + e K )/2 < £ < (£ K + £ p )/2 
(£k + £ p )/2 < £ 



4.1 Detector gain calibration with tracks 

For an accurate determination of £, it is crucial to calibrate the response of all readout chips. It is 
also important to compare the measured energy deposit spectra to the energy loss parametriza- 
tion, and introduce corrections if needed. 

The value of £ was estimated for each track using an initial gain calibration of the pixel and 
strip readout chips. Approximate particle identification was performed starting from a sample 
of identified tracks selected as follows: a track was identified as pion, kaon, or proton if its 
momentum p and most probable energy loss rate £ satisfied the tight requirements listed in Ta- 
ble [3j In addition, tracks with p > 2 GeV/c, or £ < 3.2 MeV/cm, or from identified Kg two-body 
charged decays were assumed to be pions. Identified electrons were not used. The expected 
£, path length /, and energy deposit y were collected for each hit, and stored for every readout 
chip separately. For each chip, the joint energy deposit log-likelihood, —2 Y,i log p(g ■ yy | ey, lj), 
of all selected hits (index j) was minimized by varying the multiplicative gain correction g. At 
each center-of-mass energy, approximately 10% of the data were sufficient to perform a gain 
calibration with sufficient resolution. The expected gain uncertainty is 0.5% on average for 
pixel chips and 0.5-2% for strips readout chips, depending on the chip position. 

After the detector gain calibration, the energy loss parametrization was validated with particles 
identified by the selection discussed above. As examples, the measured energy deposit distri- 
butions of positively charged hadrons for different path lengths at ^7 = p/m = 1.39 and 3.49 
are shown for PXB and TIB in Fig. |3j for the 7 TeV dataset. Similar results were obtained from 
the data taken at 0.9 and 2.76 TeV. Separate corrections for positive and negative particles were 
necessary since some effects are not charge symmetric. The energy loss parametrization Ifl5l 
(solid lines in the figures) gives a good description of the data. In order to describe deviations 
from the parametrization, we allow for an affine transformation of the theoretical distributions 
(log £ — > oc log £ + 5), the parameters of which are determined from the hit-level residuals. The 
scale factors (a) and the shifts (5) are both functions of the fi'y value of the particle and the 
length of the track segment / in silicon. The scale factors are around unity for most (87 values 
and increase to 1.2-1.4 for ^7 < 2. Shifts (S) are generally a few keV with deviations up to 
10 keV for ^ < 1. A slight path-length dependence was found for both scale factors and shifts. 
The observed behavior of these hit-level residuals, as a function of 167 and /, was parametrized 
with polynomials. These corrections were applied to individual hits during the determination 
of the log £ templates, as described below. 
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Figure 3: An example from the 7TeV dataset of the validation of the energy deposit 
parametrization. The measured energy deposit distributions of identified hadrons at given 
j67 values in the PXB (left) and TIB (right) are shown. Values are given for silicon path lengths 
of / = 270, 300, 450, 600, 750, 900 and 1050 Jim, together with predictions of the parametriza- 
tion (curves) already containing the hit-level corrections (scale factors and shifts). The average 
cluster noise cr n is also given. 

4.2 Estimation of the most probable energy loss rate for tracks 

The best value of e for each track was calculated with the corrected energy deposits. The log £ 
values in [rj, pj) bins were then used in the yield unfolding (Section[5]). Removal of hits with 
incompatible energy deposits and the creation of fit templates, giving the expected log e distri- 
butions for all particle species (electrons, pions, kaons, and protons), are discussed here. 

The value of e was estimated by minimizing the joint energy deposit negative log-likelihood of 
all hits on the trajectory (index i), x 2 = —2 Ya log p (y,- 1 £, U ) ■ Distributions of log £ as a function of 
total momentum p are plotted in Fig. [4] for electrons, pions, kaons, and protons, and compared 
to the predictions of the energy loss method. The observed deviations were taken into account 
by means of track-level corrections (cf. Section [5]). 

Since the association of hits to tracks is not always unambiguous, some hits, usually from noise 
or hit overlap, do not belong to the actual track. These false hits, or "outliers", can be removed. 
The tracks considered for hit removal were those with at least three hits and for which the joint 
energy-deposit x 2 is larger than 1.3 Hhits + 4^/1.3 n^ ts , where fthits denotes the number of hits 
on the track. If the exclusion of a hit decreased the x 2 by a * least 12, the hit was removed. At 
most one hit was removed; this affected about 1.5% of the tracks. If there is an outlier, it is 
usually the hit with the lowest AE/ Ax value. 

In addition to the most probable value of log £, the shape of the log £ distribution was also de- 
termined from the data. The template distribution for a given particle species was built from 
tracks with estimated £ values within three standard deviations of the theoretical value at a 
given J67. All kinematical parameters and hit-related observables were kept, but the energy 
deposits were re-generated by sampling from the analytical parametrization. This procedure 
exploits the success of the method at the hit level to ensure a meaningful template determina- 
tion, even for tracks with very few hits. 
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Figure 4: Distribution of log e values as a function of total momentum p for the 2.76 TeV dataset, 
for positive (left) and negative particles (right). The z scale is shown in arbitrary units and is 
linear. The curves show the expected log e for electrons, pions, kaons, and protons |2|. 



CMS 



o 
O 




CMS 



0.6 



K^_(x0.2) 

A/A 

_1 



0.8 



Invariant mass [GeV/c 



1.2 



o 
O 




1.5 2 

log(e/[MeV/cm]) 



Figure 5: Left: invariant mass distribution of Kg, A/ A, and 7 candidates. The Kg histogram is 
multiplied by 0.2. Vertical arrows denote the chosen mass limits for candidate selection. Right: 
example distribution of log £ in a narrow momentum slice at p = 0.80 GeV/c for the high-purity 
pion sample. Curves are template fits to the data, with scale factors (a) and shifts (S) also given. 
The inset shows the distributions with a logarithmic vertical scale. Both plots are from data at 
7 TeV center-of-mass energy. 

5 Fitting the log e distributions 

As seen in Fig. |3J low-momentum particles can be identified unambiguously and can therefore 
be counted. Conversely, at high momentum, the log e bands overlap (above about 0.5 GeV/c for 
pions and kaons, and 1.2 GeV/c for protons); the particle yields therefore need to be determined 
by means of a series of template fits in bins of rj and pj. This is described in the following. 

The starting point is the histogram of estimated loge values m,- in a given {j],pi) bin (z runs 
over the histogram bins), along with normalised template distributions Xfa, with k indicating 
electron, pion, kaon, or proton. The goal is to determine the yield of each particle type (fl; c ) con- 
tributing to the measured distribution. Since the entries in a histogram are Poisson-distributed, 
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CMS 



CMS 




log(e/[MeV/cm]) 



log(e/[MeV/cm]) 



Figure 6: Examples of loge distributions (symbols) for the 7TeV dataset at r/ = 0.35, pj = 
0.675 GeV/c, and corresponding template fits (solid curves represent the fit for the fthits values 
indicated on the right, lighter dashed curves are for intermediate fthits)- The most probable 
values for pions (n), kaons (K), and protons (p) are indicated. Left: distributions in fthi ts slices. 
The points and the curves were scaled down by factors of 10~" hils for better visibility with 
fthits = 1 at the top. Right: distributions in track-fit ^ 2 /ndf slices, integrated over all fthits- The 
points and the curves were scaled down by factors of 1, 10, and 100 for better visibility, with 
the lowest x 2 /ndf slice at the top. 

the corresponding log-likelihood function to minimize is 



(1) 



where tj = a^x^ contains the quantity to be fitted. The minimum for this non-linear expres- 
sion can be found by using Newton's method 1 16 1, usually within three iterations. Although the 
templates describe the measured log e distributions reasonably well, for a precision measure- 
ment further (track-level) corrections are needed to account for the remaining discrepancies 
between data and simulation. Hence, we allow for an affine transformation of the templates 
with scale factors and shifts that depend on rj and pj, the particle charge, and the particle mass. 

For a less biased determination of track-level corrections, enhanced samples of each particle 
type are necessary. For electrons and positrons, photon conversions in the beam-pipe or in 
the first pixel layer were used. For high-purity n and enhanced p samples, weakly decaying 
hadrons were selected (Kg, A/ A). Both photon conversions and weak decays were recon- 
structed by means of a simple neutral-decay finder, followed by a narrow mass cut. Invariant- 
mass distributions of the selected candidates are shown in the left panel of Fig. [5] A sample 
with enhanced kaon content was obtained by tagging K ± mesons (with the requirements listed 
in Table |3]l and looking for an opposite-sign particle which, with the kaon mass assumption, 
would give an invariant mass close to that of the </>(1020), within 2Y = 8.52MeV/c 2 . An exam- 
ple distribution of log £ for the high-purity pion sample in a narrow momentum slice is plotted 
in the right panel of Fig. [5] 
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5.1 Additional information for particle identification 

At low momentum, the log e templates for electrons and pions can be compared to the log £ dis- 
tributions of high-purity samples, but this type of validation does not work at higher momenta 
because of lack of statistics; for the same reason, it does not work for kaons and protons. It 
is therefore important to study the log e distributions in more detail: they contain useful addi- 
tional information that can be used to determine the track-level corrections, thus reducing the 
systematic uncertainties of the extracted yields. This is discussed in the following. 

a) Fitting log e in fthits slices. The fthits distribution in a given (rj, pj) bin is different for different 
particle types. Pions have a higher average number of hits per track, with fewer hits for kaons 
and even fewer for protons. These differences are due to physical effects, such as the different 
inelastic hadron-nucleon cross section, multiple Coulomb scattering, and decay in flight. It is 
therefore advantageous to simultaneously perform differential fits in fthits bins (left panel of 
Fig# 

b) Fitting log£ in track-fit ^ 2 /ndf slices. The value of the global x 2 P er number of degrees 
of freedom (ndf) of the Kalman filter used for fitting the track [17J, assuming the charged pion 
mass, can also be used to identify charged particles. Here ndf denotes the number of degrees 
of freedom for the track fit. This approach relies on the knowledge of the detector material and 
the local spatial resolution, and exploits the known physics of multiple scattering and energy 
loss; it can be used to enhance or suppress a specific particle type. The quantity x = \J x 1 /ndf 
has an approximately Gaussian distribution with mean value 1 and standard deviation a ~ 
1 / \/2 ■ ndf if the track fitted is indeed a pion. If it is not, both the mean and sigma are larger 
by a factor j5(mo)/ fi(m), where thq is the pion mass and m is the particle mass. Three classes 
were defined such that each contains an equal number of genuine pions. The condition x — 1 < 
—0.43c favors pions, and the requirements —0.43(7" < x — 1 < 0.43(7" and x — 1 > 0.43(7 enhance 
kaons and protons, respectively. An example of log £ distributions in a x 2 1 n df slice, with the 
corresponding fits, is shown in the rightpanel of Fig. [6] The increase of the kaon and proton 
yields with increasing x is visible, when compared to pions. 

c) Difference of hit losses. The fthits distribution depends on the particle species, with pions 
producing more hits than other particles. Furthermore, the fthits distributions of two particle 
types are related to each other. Let f n denote the number of particles of type / with n hits 
(ft > 1), in an (rj, pj) bin. Let us assume that another particle species g produces fewer hits, i.e. 
has a higher probability of hit loss q, taken to be roughly independent of the hit position along 
the track. The distribution of the number of hits g^ can then be predicted, with g% = r(l — 
l) k [fk + 1 X^= a jt+i fn] / where r is the ratio of particle abundances (g/ f). The hit loss (compared 
to pions) is primarily a function of momentum. At lower momenta, the best value of q can be 
estimated for each (rj, pj) bin by comparing the measured kaon or proton distributions to the 
ones predicted with the pion fthits distribution according to the formula above. An example 
of the White distributions and the corresponding fits is shown in the leftpanel of Fig. [7j The 
resulting values of q as a function of p are shown in the rightpanel of Fig. |7j for the kaon- 
pion and proton-pion pairs. The data points with q < 0.2 can be approximated with a sum 
of two exponentials in p. This can be motivated by the decay in flight for kaons, but also by 
the increase of multiple Coulomb scattering with decreasing momentum. The relation between 
the fthits distributions of two particle types has very important consequences: since the number 
of charged particles at each fthits value is known, only the local ratio r of particle abundances 
(K/ 7T, p/7r) has to be determined from the fits. 

d) Continuity of parameters. In some (r/, pj) bins the track-level corrections (scale factors 
and shifts) are difficult to determine. These parameters are expected to change smoothly as the 
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5 Fitting the log e distributions 




Figure 7: Left: example of extracted fthits distributions (symbols) of pions, kaons, and protons, 
for the 2.76 TeV dataset at rj = 0.35, pj = 0.875 GeV/c, and corresponding fits (curves, see 



Section 5.1 paragraph c). Right: probability of additional hit loss q with respect to pions as 
a function of total momentum p in the range \tj\ < 1 for positive kaons and protons, for the 
2.76 TeV dataset, if the track-fit ^ 2 /ndf value is in the lowest slice. In order to exclude regions of 
crossing log £ bands, values are not shown if p > 1.1 GeV/c for kaons, and 1.1 < p < 1.3 GeV/c 
for protons. These points were also omitted in the double-exponential fit. 

kinematical region varies The fit parameters are therefore smoothed by taking the median of 
the (rj, pj) bin and its 8 neighbors. 

e) Convergence of parameters. While the track-level corrections are independent, they should 
converge to similar values at a momentum, p c , where the £ values are the same for two particle 
types, although the energy deposit distributions can be slightly different. These momenta are 
p c = 1.56GeV/c for the pion-kaon and 2.58GeV/c for the pion-proton pair. The differences 
of fitted scale factors and shifts were studied as a function of A log e, in narrow rj slices. The 
parameter values were determined in the ranges 0.50 < p < l.OOGeV/c for kaons and 1.30 < 
p < 1.65GeV/c for protons. In these regions, the parameters were fitted and extrapolated to 
p c - At p c , the scale factors are expected to be the same and their A log £ dependence is well 
described with first-order (proton-pion) or second-order polynomials (kaon-pion), in each rj 
slice separately. More freedom had to be allowed for the shifts. While their A log £ dependence 
can be described with first-order polynomials, their difference is not required to converge to 0, 
but to a second-order polynomial of rj. 

5.2 Determination of yields 

In summary, in a given {rj, pj) bin, the free parameters are: the scale factors (usually in the 
range 0.98-1.02) and the shifts (from —0.01 to 0.01) for track-level corrections; the yields of 
particles for each ^ 2 /ndf bin or their ratios if the relationship between the n^ ts distributions of 
different particle species is used. The fit was performed simultaneously in all (nh its ,^; 2 /ndf) 
bins with nested minimizations. The optimization of the parameters was carried out with the 
Simplex package [18|, but the determination of local particle yields was performed with the 
log-likelihood merit function (Eq. (JlJ). 

In order to obtain a stable result, the fits were carried out in several passes, each containing it- 
erative steps. After each step, the resulting scale factors and shifts were the new starting points 
for the next iteration. In the first pass, log £ distributions in narrow momentum slices were 
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Figure 8: Example log e distributions at rj = 0.35 in some selected bins, for the 7 TeV dataset. 
The details of the template fits are discussed in the text. Scale factors (a.) and shifts (<5) are 
indicated. The insets show the distributions with logarithmic vertical scale. 

fitted using the enhanced electron, pion, proton, and kaon samples, as defined in Section [5] 
The fitted parameters were then used for a fit in the same slices of the inclusive dataset. In this 
way the scale factors and shifts were estimated as a function of p. In the second pass, the log e 
distributions in each (rj, px) bin were fitted. The rj bins are 0.1 units wide and cover the range 
—2.4 < rj < 2.4. The pj bins are 0.05GeV/c wide and cover the range pj < 2GeV/c. The lat- 
ter choice reflects the pj resolution (0.015-0.025 GeV/c). The procedure was repeated with the 
enhanced samples, followed again by the inclusive sample. The n^ ts distributions were used 
to extract the relationship between different particle species and this is used in all subsequent 
steps. The shifts are determined and constrained first, and then the scale factors are obtained. 
Example fits are shown in Fig. [H] In the last pass all parameters are kept constant and the fi- 
nal normalised log e templates for each particle species are extracted and used to measure the 
particle yields. 

The results of the fitting sequence are the yields for each particle species and charge, both 
inclusive and divided into track multiplicity bins. While the yields are flat in rj, they decrease 
with increasing pj, as expected. At the end of the fitting sequence ^ 2 /ndf values are usually 
close to unity, except for some low-pj fits. At low p the pions are well fitted, and the different 
species are well separated. Hence, instead of fitting kaon or proton yields, it is sufficient to 
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6 Corrections 



Table 4: Momentum ranges used in various steps and procedures of the analysis. Total momen- 
tum values are given in GeV/c. The use of hit loss and parameter convergence is with respect 
to n for K, and n+K for p. 



Particle 


Count 


Fit 


Hit loss 


Convergence 


Physics 


e 




p < 0.15 






0.10 < p < 0.15 


71 




p < 1.30 




0.95 < p < 1.30 


0.10 < p < 1.20 


7T+K 




1.30 < p < 1.95 






1.05 < p < 1.50 


K 


0.12 < p < 0.27 


0.20 < p < 1.30 


p > 0.70 


0.95 < p < 1.30 


0.20 < p < 1.05 


P 


0.27 < p < 0.70 


0.30 < p < 1.95 


p > 1.45 


1.60 < p < 1.95 


0.35 < p < 1.70 



count the number of entries above the fitted shape of the pion distribution. 

Table ^summarizes the particle-specific momentum ranges for the following procedures: count- 
ing the yields (Count); using a particle species in the fits (Fit, paragraphs a and b in Section [BTTj l; 
using the correspondence between hit losses in the fits (Hit loss, paragraph c); using the princi- 
ple of convergence for track-level corrections in the fits (Convergence, paragraph e); and using 
the fitted yields for physics (Physics). The use of these ranges limits the systematic uncertainties 
at high momentum. The ranges, after evaluation of the individual fits, were set such that the 
systematic uncertainty of the measured yields does not exceed 10%. For p > 1.30GeV/c, pions 
and kaons were not fitted separately, but were regarded as one particle species (n+K row in Ta- 
ble |4|. In fact, fitted pion and kaon yields were not used for p > 1.20 GeV/c and p > 1.05 GeV/c, 
respectively. Although pion and kaon yields cannot be determined in this high-momentum 
region, their sum can be measured. This information is an important constraint when fitting 
the pj spectra (Section |7|. 

The statistical uncertainties for the extracted yields are given by the fits. The observed local 
(n, pj) variations of parameters for track-level corrections cannot be attributed to statistical 
fluctuations and indicate that the average systematic uncertainties of the scale factors and shifts 
are about 10~ 2 and 2 • 10~ 3 , respectively. The systematic uncertainties on the yields in each bin 
were obtained by refitting the histograms with the parameters changed by these amounts. 



6 Corrections 

The measured yields in each (rj, pj) bin, AN measure d, were first corrected for the misreconstructed- 



track rate (Cf, Section 3.2 1 and the fraction of secondaries (C s , Section 3.3 1: 



AN' = AN measured • (1 - C f ) ■ (1 - C 8 ). (2) 

Bins in which the misreconstructed-track rate was larger than 0.1 or the fraction of secondaries 
was larger than 0.25 were rejected. 

The distributions were then unfolded to take into account the finite r/ and pj resolutions. The 
n distribution of the tracks is flat and the r\ resolution is very good. Conversely, the pj dis- 
tribution is steep in the low-momentum region and separate corrections in each r/ bin were 
necessary. In addition, the reconstructed pj distributions for kaons and protons, at very low 
pj, are shifted with respect to the generated distributions by about 0.025 GeV/c. This bias is a 
consequence of using the pion mass for all charged particles (see Section [BTTj l. A straightfor- 
ward unfolding procedure with linear regularization [16j was used, based on response matri- 
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ces R obtained from MC samples for each particle species. With o and m denoting the vector 
of original and measured differential yields (d 2 N/ dt] dpj), the sum of the chi-squared term 
(Ro — m) T V^ 1 (Ro — m) and a regularizer term \o T Ho is minimized by varying o, where H is 
a tridiagonal matrix. The covariance of measured values is approximated by Vu ~ mjSjj, where 
S{j is Kronecker's delta. The value of A is adjusted such that the minimized sum of the two 
terms equals the number of degrees of freedom. In practice the parameter A is small, of the 
order of 10 ~ 5 . 



The corrected yields were obtained by applying corrections (cf. Section [32j l for acceptance (C a ), 
efficiency (C e ), and multiple reconstruction rate (C m ): 

1 d 2 N _ 1 AN' 

Ne;dJ7dpT correct ed ~ Ca-Ce- (1 + C m ) N ev At] Ap T ' 

where N ev is the corrected number of DS events (see Section |3|. Bins with acceptance smaller 
than 0.5, efficiency smaller than 0.5, or multiple-track rate greater than 0.1 were rejected. 

Finally, the differential yields d 2 N/ dt] dpi were transformed to invariant yields as a function 
of the rapidity y by multiplying by the Jacobian E/ p, and the (tj, pj) bins were mapped into 
a (y, pj) grid. The invariant yields l/N ev d 2 N/di/dpT as a function of pj were obtained by 
averaging over y in the range — 1 < y < 1. They are largely independent of y in the narrow 
region considered, as expected. 

6.1 Systematic uncertainties 

The systematic uncertainties are summarized in Table|5j they are subdivided in three categories. 



The uncertainties of the corrections related to the event selection (Section |3.1[ ) and 
pileup (Section [33] > are fully or mostly correlated and were treated as normalisation 
uncertainties. They amount to a 3.0% systematic uncertainty on the yields and 1.0% 
on the average pj. 

The pixel hit efficiency and the effects of a possible misalignment of the detector el- 
ements are mostly uncorrelated. Their contribution to the yield uncertainty is about 
0.3% 0. 

Other mostly uncorrelated systematic effects are the following: the tracker accep- 
tance and the track reconstruction efficiency (Section [3.2| generally have small uncer- 
tainties (1% and 2%, respectively), but change rapidly at very low pj, leading to a 5- 
6% uncertainty on the yields in that range; for the multiple-track and misreconstructed- 
track rate corrections (Section [32] |, the uncertainty is assumed to be 50% of the cor- 
rection, while for the case of the correction for secondary particles it is 20% (Sec- 



tion 1331. The uncertainty of the fitted yields (Section 5.2 1 also belongs to this cate- 
gory. 

In the weighted averages and the fits discussed in the following, the quadratic sum of statistical 
and systematic uncertainties (referred to as combined uncertainty) is used. The fully correlated 
systematic uncertainties (event selection and pileup) are not displayed in the plots. 



7 Results 



In previously published measurements of unidentified and identified particle spectra, the fol- 
lowing form of the Tsallis-Pareto-type distribution |[T9ll20l was fitted to the data: 
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Table 5: Summary of the systematic uncertainties on the spectra. Values in parentheses indicate 
uncertainties on the (px) measurement. Representative, particle-specific uncertainties (n, K, p) 
are shown at px = 0.6GeV/c. 



Uncertainty Propagated 

dOUTCB 

of the source [%] yield uncertainty [%] 

Fully correlated, normalisation 



Correction for event selection 


3.0 (1.0) 






3.0 (1.0) 




Pileup correction (merged and split vertices) 


0.3 




\ 




Mostly uncorrelated 
Pixel hit efficiency 
Misalignment, different scenarios 


0.3 
0.1 




0.3 


Mostly uncorrelated, (y, px) dependent 
Acceptance of the tracker 


1-6 


7T 
1 


K 
1 


P 
1 


Efficiency of the reconstruction 


2-5 


2 


2 


2 


Multiple-track reconstruction 


50% of the corr. 










Misreconstructed-track rate 


50% of the corr. 


<0.5 


<0.5 


0.5 


Correction for secondary particles 
Fitting log £ distributions 


20% of the corr. 
1-10 


<0.5 
1 


2 


2 
1 



where 



d 2 N 
dydpj 



dN 
dy 



C- p T 



1 + 



Ox 



m 



nT 



(4) 



C 



l)(n-2) 



nT[nT + (n -2)m] 



(5) 



and mj 



iii- 



+ Pj (c factors are omitted from the preceding formulae). The free parameters 



are the integrated yield dN / dy, the exponent n, and the inverse slope T. The above formula is 
useful for extrapolating the spectra to px = 0, and for extracting (pj) and dN/ dy. Its validity in 
the present analysis was cross-checked by fitting MC spectra and verifying that the fitted values 
of (px) and dN/ dy were consistent with the generated values. According to some models of 
particle production based on non-extensive thermodynamics [20], the parameter T is connected 
with the average particle energy, while n characterizes the "non-extensivity" of the process, i.e. 
the departure of the spectra from a Boltzmann distribution. 

As discussed e arlie r, pions and kaons cannot be unambiguously distinguished at higher mo- 
menta (Section 5.2 1. Because of this, the pion-only (kaon-only) d 2 N/ dydpj distribution was 
fitted for \y\ < 1 and p < 1.20 GeV/c (p < 1.05 GeV/c); the joint pion and kaon distribution was 
instead fitted if \rj\ < 1 and 1.05 < p < 1.5GeV/c. Since the ratio p/E for the pions (which are 
more abundant than kaons) at these momenta can be approximated by p-r/mj at t] ~ 0, Eq. Q 
becomes: 



d 2 N _ dN p\ 
dndpj dy nij 



1 + 



m 



nT 



(6) 



7.1 Inclusive measurements 
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In the case of pions and protons, the measurements cover a wide pj range: the yields and aver- 
age px can thus be determined with small systematic uncertainty. For the kaons the number of 
measurements is small and the pj range is limited. Therefore, for the combined pion and kaon 
fits, the kaon component was weighted by a factor of four, leading to the following function to 
be minimized: Xn + Xn+K + ^k- This weight accounts for the pj range, which is narrower by 
a factor about two, and also for the partial correlation between the pion measurement and that 
of the sum of pions and kaons, which gives another factor two. 

The average transverse momentum (pj) and its uncertainty were obtained by numerical inte- 
gration of Eq. Q with the fitted parameters. 

The results discussed in the following are for \y\ < 1 at a/s = 0.9, 2.76, and 7TeV. In all cases, 
error bars indicate the uncorrelated statistical uncertainties, while bands show the uncorrelated 
systematic uncertainties. The fully correlated normalisation uncertainty (not shown) is 3.0%. 
For the pj spectra, the average transverse momentum, and the ratio of particle yields, the data 
are compared to the D6T and Z2 tunes of PYTHIA6 EJ as well as to the 4C tune of PYTHIA8 ||2T|. 

7.1 Inclusive measurements 

The transverse momentum distributions of positive and negative hadrons (pions, kaons, pro- 
tons) are shown in Fig. |9j along with the results of the fits to the Tsallis-Pareto parametrization 
(Eqs. |D and |6}). The fits are of good quality with ^ 2 /ndf values in the range 0.6-1.5 for pi- 
ons, 0.6-2.1 for kaons, and 0.4-1.1 for protons. Figure [l0|presents the data compared to various 
Pythia tunes. Tunes D6T and 4C tend to be systematically below or above the spectra, whereas 
Z2 is generally closer to the measurements (except for low-pj protons). 



Ratios of particle yields as a function of the transverse momentum are plotted in Fig. 11 While 
the p/n ratios are well described by all tunes, there are substantial deviations for the K/ n 
ratios, also seen by other experiments and at different energies. CMS measurements of Kg and 
A/ A production [14] are consistent with the discrepancies seen here. The ratios of the yields 
for oppositely charged particles are close to one, as expected for pair-produced particles at 
midrapidity. Ratios for pions and kaons are compatible with unity, independently of pj. While 
the p/ p ratios are also flat as a function of pj, they increase with increasing \/s. 

7.2 Multiplicity-dependent measurements 

This study is motivated by the intriguing hadron correlations measured in pp collisions at 
high track multiplicities ||22| , which suggest possible collective effects in "central" pp collisions 
at the LHC. In addition, the multiplicity dependence of particle yield ratios is sensitive to 
various final-state effects (hadronization, color reconnection, collective flow) implemented in 
MC models used in collider and cosmic-ray physics 



Twelve event classes were defined, each with a different number of reconstructed particles: 
N rec = (0-9), (10-19), (20-29), (100-109) and (110-119), as shown in Table [6] In order to 
facilitate comparisons with models, the corresponding true track multiplicity in the range \tj\ < 
2.4 (N trac k s ) was determined from the simulation. The average (N trac ks) values, given in Table|6] 
are used in the plots presented in the following. The results in the table were found to be 
independent of the center-of-mass energy and the Pythia tune. 

The normalized transverse-momentum distributions of identified charged hadrons in selected 
multiplicity classes, for \y\ < 1 and y/s = 0.9, 2.76, and 7TeV, are shown in Figs. 
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and [14} for pions, kaons, and protons, respectively. The distributions of negatively and pos- 
itively charged particles have been summed. The distributions are fitted to the Tsallis-Pareto 
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Figure 9: Transverse momentum distributions of identified charged hadrons (pions, kaons, 
protons) in the range \y\ < 1, for positive (left) and negative (right) particles, at y/s = 0.9, 
2.76, and 7 TeV (from top to bottom). Kaon and proton distributions are scaled as shown in 
the legends. Fits to Eq. are superimposed. Error bars indicate the uncorrelated statistical 
uncertainties, while bands show the uncorrelated systematic uncertainties. The fully correlated 
normalisation uncertainty (not shown) is 3.0%. 



7.2 Multiplicity-dependent measurements 
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Figure 10: Transverse momentum distributions of identified charged hadrons (pions, kaons, 
protons) in the range |y| < 1, for positive (left) and negative (right) particles, at \/s = 0.9, 2.76, 
and 7TeV (from top to bottom). Measured values (same as in Fig. [9) are plotted together with 
predictions from PYTHIA6 (D6T and Z2 tunes) and the 4C tune of PYTHIA8. Error bars indi- 
cate the uncorrelated statistical uncertainties, while bands show the uncorrelated systematic 
uncertainties. The fully correlated normalisation uncertainty (not shown) is 3.0%. 
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Figure 11: Ratios of particle yields as a function of transverse momentum, at a/s = 0-9, 2.76, 
and 7 TeV (from top to bottom). Error bars indicate the uncorrelated statistical uncertainties, 
while boxes show the uncorrelated systematic uncertainties. Curves indicate predictions from 
PYTHIA6 (D6T and Z2 tunes) and the 4C tune of PYTHIA8. 



7.2 Multiplicity-dependent measurements 



21 



Table 6: Relationship between the number of reconstructed tracks (N rec ) and the average num- 
ber of true tracks ({N tTac ]^)) in the 12 multiplicity classes considered. 



ON ON 
O * — I 



t — N CO ^ N X 0*\ ' ' 

O-v i i i i i i i i lOO 
AT iOOOOOOOOO O t — I 

iVrec O^ntNCO^LO^OKOOON 

(^tracks) 7 16 28 40 52 63 75 86 98 109 120 131 



parametrization. In the case of pions, the distributions are remarkably similar, and essentially 
independent of -Js and multiplicity. For kaons and protons, there is a clear evolution as the 
multiplicity increases. The inverse slope parameter T increases with multiplicity for both kaons 
and protons, while the exponent n is independent of the multiplicity (not shown in the figures). 



The ratios of particle yields as a function of track multiplicity are displayed in Fig. 15 The 
K/ n and p/n ratios are flat as a function of N trac k s . Although the trend at low N trac k s is not 
reproduced by any of the tunes, the values are approximately correct for tunes D6T and Z2, 
while 4C is off, especially for K/ n. The ratios of yields of oppositely charged particles are 
independent of N trac ks- 



The average transverse momentum (pj) is shown as a function of multiplicity in Fig. 16 The 
plots are similar, and largely independent of y^s, for all the particle species studied. Pions and 
kaons are well described by the Z2 and 4C tunes, while D6T predicts values that are too high 
at high multiplicities. None of the tunes provide an acceptable description of the multiplicity 
dependence of (px) for protons, and the measured values lie between D6T and Z2. For the 
dependence of T on multiplicity (not shown in the figures), the predictions are consistently 
higher than the pion data for all tunes; the kaon and proton data are again between D6T and 
Z2, somewhat closer to the latter. Tune 4C gives a flat multiplicity dependence for T and is not 
favored by the kaon and proton measurements. 

The center-of-mass energy dependence of dN/dy, the average transverse momentum (pj), 



and the particle yield ratios are shown in Fig. 17 For dN / dy, the Z2 tune gives the best overall 



description. The (pj) of pions is reproduced by tune 4C, that of the kaons is best described 
by Z2, and that of the protons is not reproduced by any of the tunes, with D6T closest to the 
data. The ratios of the yields for oppositely charged mesons are independent of y/s and have 
values of about 0.98 for the pions; the kaon ratios are compatible with those of the pion and also 
with unity. The slight deviation from unity observed for the pions probably reflects the initial 
charge asymmetry of pp collisions. The pi p yield ratio appears to increase with \fs, though it 
is difficult to draw definite conclusions because of the large systematic uncertainties. The K/ n 
and p/n ratios are flat as a function of \Js, and have values of 0.13 and 0.06-0.07, respectively. 
The exponent n (not shown in the figures) decreases with increasing y/s for pions and protons. 
For the kaons the systematic uncertainties are too large to draw a definite conclusion. The 
inverse slope T (also not shown) is flat as a function of y/s for the pions but exhibits a slight 
increase for the protons. The universality of the relation of (pj) and the particle-yield ratios 
with the track multiplicity, and its independence of the collision energy is demonstrated in 



Fig. 18 



The transverse-momentum distributions of identified charged hadrons at central rapidity are 
compared to those of the ALICE Collaboration [24] at y/s = 0.9 TeV in Fig. 19 (|y| < 1 for 
CMS, \y\ < 0.5 for ALICE). While the rapidity coverage is different, the measurements can be 
compared because the pj spectra are largely independent of y for \y\ < 1. The results from 
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Figure 12: Normalized transverse momentum distributions of charged pions in a few represen- 
tative multiplicity classes, in the range \y\ < 1, at y/s = 0.9, 2.76, and 7TeV, fitted to the Tsallis- 
Pareto parametrization (solid lines). For better visibility, the result for any given (N trac k s ) bin 
is shifted by 0.5 units with respect to the adjacent bins. Error bars indicate the uncorrelated 
statistical uncertainties, while bands show the uncorrelated systematic uncertainties. 

the two experiments agree well for the mesons, and exhibit some small discrepancies for the 
protons. 

The center-of-mass energy dependence of dN / dy in the central rapidity region and the average 



transverse momentum for pions, kaons, and protons are shown in Fig. 20 Measurements from 
UA2 [25J, E735 S PHENIX (2Z|, STAR gg, ALICE EH, and CMS are shown. The observed 
\J s evolution of both quantities is consistent with a power-law increase. 

The comparison of the central rapidity p/ p ratio as a function of the rapidity interval Ay is 



displayed in Fig. 21 This quantity is defined as Ay = i/beam - 3/baryon/ where t/beam (i/baryon) is 
the rapidity of the incoming beam (outgoing baryon). Measurements from ISR energies II29H30I , 
NA49 EH, BRAHMS Q PHENIX HQ, PHOBOS |M|, and STAR |35l are shown together 
with LHC (ALICE 11361 and CMS) data. The curve represents the expected Ay dependence in a 
Regge-inspired model, where baryon pair production is governed by Pomeron exchange, and 
baryon transport by string-junction exchange |37| . The functional form used is (p/p) -1 = 
1 + Cexp[(aj — a.p) Ay] with C = 10, ccp = 1.2, and aj = 0.5, as used in the ALICE paper. The 
CMS data are consistent with previous measurements, as well as with the proposed function. 



23 



CMS CMS 




0.5 1 1.5 

p T [GeV/c] 



Figure 13: Normalized transverse momentum distributions of charged kaons in a few represen- 
tative multiplicity classes, in the range \y\ < 1, at y/s = 0.9, 2.76, and 7 TeV, fitted to the Tsallis- 
Pareto parametrization (solid lines). For better visibility, the result for any given (N trac k s ) bin 
is shifted by 0.5 units with respect to the adjacent bins. Error bars indicate the uncorrelated 
statistical uncertainties, while bands show the uncorrelated systematic uncertainties. 

8 Conclusions 

Measurements of identified charged hadrons produced in pp collisions at a/s = 0.9, 2.76, and 
7 TeV have been presented, based on data collected in events with simultaneous hadronic ac- 
tivity at pseudorapidities — 5 < rj < — 3 and 3 < rj < 5. Charged pions, kaons, and protons 
were identified from the energy deposited in the silicon tracker (pixels and strips) and other 
track information (number of hits and goodness of track-fit). CMS data extend the center-of- 
mass energy range of previous measurements and are consistent with them at lower energies. 
Moreover, in the present analysis the data have been studied differentially, as a function of 
the particle multiplicity in the event and of the collision energy. The results can be used to 
further constrain models of hadron production and contribute to the understanding of basic 
non-perturbative dynamics in hadronic collisions. 

The measured track multiplicity dependence of the rapidity density and of the average trans- 
verse momentum indicates that particle production at LHC energies is strongly correlated with 
event particle multiplicity rather than with the center-of-mass energy of the collision. This cor- 
relation may reflect the fact that at TeV energies the characteristics of particle production in 
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CMS CMS 




p T [GeV/c] 

Figure 14: Normalized transverse momentum distributions of charged protons in a few rep- 
resentative multiplicity classes, in the range |y| < 1, at y/s = 0.9, 2.76, and 7TeV, fitted to the 
Tsallis-Pareto parametrization (solid lines). For better visibility, the result for any given (N^^) 
bin is shifted by 0.5 units with respect to the adjacent bins. Error bars indicate the uncorrelated 
statistical uncertainties, while bands show the uncorrelated systematic uncertainties. 

hadronic collisions are constrained by the amount of initial parton energy available in a given 
collision. 
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Figure 15: Ratios of particles yields in the range \y\ < 1 as a function of the true track mul- 
tiplicity for \t] | < 2.4, at y/s = 0.9, 2.76, and 7 TeV (from top to bottom). Error bars indicate 
the uncorrected combined uncertainties, while boxes show the uncorrelated systematic un- 
certainties. Curves indicate predictions from PYTHIA6 (D6T and Z2 tunes) and the 4C tune of 
PYTHIA8. 
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Figure 16: Average transverse momentum of identified charged hadrons (pions, kaons, pro- 
tons) in the range \y\ < 1, for positive (left) and negative (right) particles, as a function of the 
true track multiplicity for \ij \ < 2.4, at a/s = 0.9, 2.76, and 7 TeV (from top to bottom). Error bars 
indicate the uncorrelated combined uncertainties, while boxes show the uncorrelated system- 
atic uncertainties. The fully correlated normalisation uncertainty (not shown) is 1.0%. Curves 
indicate predictions from PYTHIA6 (D6T and Z2 tunes) and the 4C tune of PYTHIA8. 
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Figure 17: Center-of-mass energy dependence of dN / dy, average transverse momentum (px), 
and ratios of particle yields. Error bars indicate the uncorrelated combined uncertainties, 
while boxes show the uncorrelated systematic uncertainties. For dN / dy ((pj)) the fully corre- 
lated normalisation uncertainty (not shown) is 3.0% (1.0%). Curves indicate predictions from 
PYTHIA6 (D6T and Z2 tunes) and the 4C tune of PYTHIA8. 
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Figure 18: Left: average transverse momentum of identified charged hadrons (pions, kaons, 
protons) in the range \y\ < 1, for all particle types, as a function of the true track multiplicity 
for \rj\ < 2.4, for all energies. Right: ratios of particle yields as a function of particle multiplic- 
ity for 1 7/| < 2.4, for all energies. Error bars indicate the uncorrelated combined uncertainties, 
while boxes show the uncorrelated systematic uncertainties. For (pj) the fully correlated nor- 
malisation uncertainty (not shown) is 1.0%. Lines are drawn to guide the eye (solid - 0.9 TeV, 
dotted - 2.76 TeV, dash-dotted - 7 TeV). 
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Figure 19: Comparison of transverse momentum distributions of identified charged hadrons 
(pions, kaons, protons) at central rapidity (\y\ < 1 for CMS, \y\ < 0.5 for ALICE [24]), for 
positive hadrons (left) and negative hadrons (right), at \/s = 0.9 TeV. To improve clarity, the 
kaon and proton points are scaled by the quoted factors. Error bars indicate the uncorrelated 
statistical uncertainties, while bands show the uncorrelated systematic uncertainties. In the 
CMS case the fully correlated normalisation uncertainty (not shown) is 3.0%. The ALICE results 
were corrected to inelastic pp collisions and therefore the CMS points are scaled by an empirical 
factor of 0.78 so as to correct for the different particle level selection used by ALICE. 
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Figure 20: Comparison of the center-of-mass energy dependence of the central rapidity density 
dN/dy (left) and the average transverse momentum (pj) (right). Low-energy data (UA2 II25I , 
E735 (SO, PHENIX £ZI, STAR gSJ) are shown with LHC data (ALICE|2U and CMS). For the 
CMS points, the error bars indicate the uncorrelated combined uncertainties, while boxes show 
the uncorrelated systematic uncertainties. The fully correlated normalisation uncertainty (not 
shown) is around 3.0% (left plot) and 1.0% (right plot). 
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Figure 21: Comparison of the central rapidity p/p yield ratio as a function of the rapidity 
difference Ay, plotted together with the prediction of the Regge-inspired model 1371 . Measure- 
ments at low energies (ISR, |29H30l), NA49 IE BRAHMS EH, PHENIX (331, PHOBOS EH, 
and STAR E3 are shown along with LHC data (ALICE and CMS). 
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